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The distribution of returns in financial time series exhibits heavy tails. In empirical studies, it 
has been found that gaps between the orders in the order book lead to large price shifts and thereby 
to these heavy tails. We set up an agent based model to study this issue and, in particular, how the 
gaps in the order book emerge. The trading mechanism in our model is based on a double- auction 
order book, which is used on nearly all stock exchanges. In situations where the order book is densely 
occupied with limit orders we do not observe fat-tailed distributions. As soon as less liquidity is 
available, a gap structure forms which leads to return distributions with heavy tails. We show that 
return distributions with heavy tails are an order-book effect if the available liquidity is constrained. 
This is largely independent of the specific trading strategies. 



I. INTRODUCTION 

A variety of stylized facts have been identified in empir- 
ical studies of financial markets [1] . A prominent example 
is the heavy-tailed distribution of stock price returns [2] . 
The precise shape of the tails has been examined in de- 
tail [2-7]. For return intervals smaller than one day, a 
power-law behavior fits the data well [8, 9]. Although 
the first empirical findings are roughly 50 years old, ex- 
planations for this effect are still subject to controversial 
discussions [10-18]. Following the most common reason- 
ing, the size of orders plays a crucial role in the emer- 
gence of the non-normal distributed returns [17, 19-21]. 
In contrast, Farmer et al. concluded from a detailed em- 
pirical investigation that the gaps between orders in the 
order book lead to the heavy tails [22]. Stochastic mod- 
eling can only describe this and other stylized facts, but 
is unable to provide a deeper understanding. 

Agent-based modeling provides means to trace back 
the emergence of stylized facts to the microscopic mech- 
anisms of trading and to the traders' behavior or strate- 
gies. In the past a variety of agent-based models were 
set up [23-29] . A prominent example is the Santa Fe Ar- 
tificial Stock Market [30], developed to study the emer- 
gence of trading strategies over time. In contrast there 
are models which are built solely to study certain aspects 
using small parameter sets [31-33]. We follow the latter 
approach; by keeping our model simple we are able to 
relate our parameters to empirical information. While in 
many models the price formation is the result of a bal- 
ance of supply and demand, the crucial mechanism in 
our setup is a double-auction order book. Our results 
support the view of Farmer et al. that the gaps in the 
order book are the prime reason for the heavy tails of the 
return distribution. 

The paper is organized as follows: In Sec. II, we lay 
out our agent-based model. The emergence of heavy- 
tailed return distributions within our model is discussed 
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in Sec. III. We conclude our findings in Sec. IV. 



II. MODEL 

We implement a double-auction order book and differ- 
ent types of traders, each following a fixed set of rules. 
The traders interact via the order book by submitting 
market or limit orders to buy or sell stocks. The or- 
der book is the crucial mechanism where the demand for 
stocks meets the available supply. 

The limit orders are stored in ascending order from 
the cheapest buy order to the most expensive sell or- 
der. Prices are discretized due to the tick-size, i.e, only 
discrete price levels are present in the order book. Limit 
orders which do not trigger a trade are stored in the order 
book, while marketable orders are cleared immediately. 

We use discrete time steps which correspond to simu- 
lation steps. In each simulation step an arbitrary number 
of traders can be active. An active trader is allowed to 
issue one order during this time step. After the trader 
finishes his trading activity, the amount of time steps 
to his next activity is drawn from a random distribution. 
For these waiting times i wt we choose an exponential dis- 
tribution 

p(*wt) = — exp(-i wt /,u wt ) with fJlwt = cN, (1) 

where N is the number of traders. With the scaling pa- 
rameter c, we calibrate the number of active traders dur- 
ing one time step to match empirical trade frequencies. 
We set c to achieve roughly 5.4 trades per minute. This 
corresponds to the average trade frequency if we look at 
the top 75% of stocks in the S&P 500 during the year 
2007. 

If more than one trader is active during a single time 
step, their orders are randomized to prevent serial cor- 
relations. Traders are free to choose a lifetime for their 
orders in accordance with their strategy. At the end of 
each time step orders that have expired are removed from 
the order book. By default the lifetime of an order is 
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drawn from an exponential distribution 

P(kt) = — exp(-ii t //Uit) , (2) 
Mit 

which results in an average order lifetime of In case 
of very small order lifetimes, the amount of limit orders 
in the order book becomes so small that the order book 
plays a minor role. In such a case the price formation is 
mainly driven by the specific behavior of the trader, in 
particular by the distribution used to determine the order 
price. In the following we only consider order lifetimes 
that are sufficiently large to avoid this problem, i.e., > 
40 time steps. 

The most basic trader in our model is called Random- 
Trader. He does not follow a strategy, his sell and buy 
decisions are completely random. Buy and sell orders 
are issued with the same probability. The trader places 
his limit orders with a price drawn from a normal dis- 
tribution centered around the corresponding best price, 
i.e., best ask in case of sell orders and best bid in case 
of buy orders. The size of an order v is drawn from an 
exponential distribution, 

p(v) = —— cxp(-w/^ vol ) with fi vo \ = 10 . (3) 

Mvol 

The volumes drawn from this distribution can be scaled 
by multiplying them with k. In case of the RandomTrader 
the scaling parameter k is equal to one. The distribution 
and its parameter /i vo i have been chosen to roughly fit 
empirical order sizes found in the S&P 500 index for the 
year 2007. 

To study the effects of larger order sizes we use the 
BigTrader, which is a RandomTrader with an additional 
parameter k to scale the order size. 

To avoid unnecessary boundary conditions we allow 
short-selling and give an unlimited credit line to each 
trader. While short-selling is allowed in reality we as- 
sume an unlimited credit line for the following reasons. 
The individual RandomTraders are indistinguishable from 
each other due to their entirely random actions. We do 
not aim at a one-to-one correspondence between our sim- 
ulated and real traders and it is not our goal that traders 
evaluate and optimize their strategy. For an external ob- 
server all incoming orders look random and therefore the 
actual strategies of the market participants do not play 
any role. The power of this approach is discussed in [34] . 
In this sense the random character represents the typi- 
cal behavior of traders. Therefore we may assume that 
on average all RandomTraders own the same amount of 
money, which in our case is the average of their accumu- 
lated credit debt. As a consequence the results do not 
depend on the number of traders in case of the Random- 
Trader. Nevertheless a minimum of traders is required so 
that a wide range of waiting times, drawn from the dis- 
tribution given in Eq. (1), is present at any time during 
the simulation run. 



III. RESULTS 

We conducted the simulations with 300 Random- 
Traders. The scale of the waiting time c was adjusted so 
that all simulations have roughly 5.4 trades per minute 
which corresponds to typical empirical time scales for 
frequently traded stocks. Other values of c give similar 
results. We use traded prices s(t) to calculate the one- 
minute (At = 60 time steps) returns 

r(t) = + ( 4 ) 
s(t) 

for each trading day. For these returns we calculate the 
standard deviation 

a = sjwfh ( 5 ) 

and the mean \x = (r(t)) T for each simulation with a 
length of T = 5 • 10 5 time steps. Next, we calculate the 
normalized returns 

m = rJ ^ , (6) 

which are independent of the simulation length T for 
large T. It is also worth noting that the simulations 
are stable for the simulation length used here, i.e. the 
price fluctuates around the starting price for a time much 
greater than T. 

In Fig. 1 the probability density functions for normal- 
ized returns g of six simulation scenarios are shown for 
different values of the order lifetime /xi t . The probability 
distributions are generated from 5000 simulations with 
different random seeds. For small order lifetimes (around 
40 < fi\t < 400 time steps) we observe a fat-tailed re- 
turn distribution, while for larger order lifetimes the re- 
turn distribution approaches a normal distribution. The 
distribution of volatilities p(a) for an order lifetime of 
(i\t = 120 time steps calculated for a moving window of 
T = 1000 time steps is shown in Fig. 2. Previous stud- 
ies showed that empirical found volatilities are described 
quite well by a log- normal distribution [35, 36] which is 
shown for comparison. Clearly, due to the simplicity of 
our model, we cannot expect a full agreement with the 
empirical distribution. It is rather an encouraging cor- 
roboration of our approach that we get a qualitatively 
similar distribution. 

In our model the lifetime of an order is a way to in- 
directly adjust the available volume in the order book. 
Large order lifetimes lead to a saturation of the order 
book in which nearly all discrete price levels are occu- 
pied. On the other hand, for small order lifetimes less 
price levels are occupied. This leads to larger gaps be- 
tween occupied levels, which is shown in Fig. 3. We use 
the kurtosis excess 




FIG. 1: The figures show the probability distribution of normalized one-minute returns g for six simulation scenarios 
which differ only by their order lifetime A normal distribution (red, dashed) is shown for comparison. 
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FIG. 2: The plot shows the probability distribution of the volatilities p(cr). The volatilities were calculated for a 
moving time window of T = 1000 time steps using one-minute returns from simulations with an order lifetime of 

/in = 120. A log- normal distribution is given for comparison. 
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FIG. 3: Two typical snap shots of the order book, (a) Fewer discrete price levels are occupied and large gaps exist 
between the occupied levels. We assigned a negative sign to the volume of buy orders to better distinguish them 
from the sell orders. The midpoint is shown as a dotted line, (b) The order book is much more dense and the 
distance between occupied levels is much smaller compared to (a). 
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FIG. 4: The plot shows the excess kurtosis versus the average volume per day stored in the order book for different 
order lifetimes. Note that the abscissa has a logarithmic scale. The dots from left to right correspond to an order 
lifetime of Wt = 120, 130, 140, 150, 160, 170, 180, 190, 200, 220, 240, 260, 280, 300, 350, 400, 450, 500, 600, 700, 800, 

900, 1000, 1200, 1800 and 3600 time steps. 



where \i\ is the fourth central moment, as a measure how 
much the tail of a distribution deviates from a normal 
distribution. For mesokurtic distributions, e.g., the nor- 
mal distribution, the kurtosis excess is zero. Leptokurtic 
distributions whose kurtosis excess is greater than zero 
have more pronounced tails compared to a normal dis- 
tribution. In Fig. 4 the kurtosis excess of the return 
distribution is plotted versus the average volume per day 
in the order book. The dots from left to right correspond 
to different waiting times, starting with = 120 and 
ending with fi\ t = 3600 time steps. We clearly see a de- 
cline of the kurtosis excess from 14 towards zero when 
the volume in the order book increases. The decrease of 
the kurtosis excess is in line with the distributions shown 
in Fig. 1, where the fat-tails vanish towards larger order 
lifetimes fi\t . We therefore note that in situations of high 
liquidity the fat-tails disappear. 

To further quantify the effect of gaps within our model 
we study the virtual market impact function introduced 
in Ref. [22]. It describes how the price would change 
given a hypothetical market order of volume v. Thus 
it is a test of the volume dependence of the price shift 
distribution. We define a supply and demand function 

l 

S(l,t)= 

i= : (t) (8) 

D(l,t)= V ( i ' t ) . 
i=b(t) 

where V(i, t) is the available volume at the discrete price 
level i. The supply and demand functions represent the 



total volume stored in the order book which is available 
to sell or buy up to price I starting from the best ask 
a(t) or best buy b(t). The inverse functions of S and D, 
l(S,t) and l(D,t), are the virtual market impact func- 
tions. The inverse functions do exist because the supply 
and demand functions in Eq. (8) are monotonically in- 
creasing. For example, a buy market order of size S will 
produce a price shift of As(t) = l(S,t) — a(t). After 
calculating the price shifts As(t) for all time steps for 
a fixed volume v we can work out the distribution and 
the cumulative distribution P(As). By comparing dis- 
tributions for different volumes v we can see whether the 
volume v changes the shape of the cumulative distribu- 
tion and therefore see the influence of order size on the 
price shift distribution. 

We now calculate the virtual market impact functions 
for the simulations containing only RandomTraders with 
an order lifetime of 120 and 1200 time steps. Figure 1 
shows that the return distribution for [i\ t = 120 time 
steps is fat-tailed while for /j,\ t = 1200 time steps the dis- 
tribution approaches a normal distribution. We have to 
constrain the number of simulations to 100 each due to 
the large storage requirements of order book data. We 
choose the volumes v = 3, 10, 700, 1100 so that they rep- 
resent the 0.1,0.5,0.9 and 0.99 quantiles of the traded 
volume given a scenario that contains 30 BigTraders with 
k = 5. We thus predict what would happen if we added 
traders who issue orders with larger sizes to the simula- 
tion. In Fig. 5a we see the virtual market impact func- 
tions for the first case, where we observed fat-tails. If 
we assume power law behavior, the linear parts will be 
parallel to each other, i.e., they are independent of the 
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FIG. 5: Complementary cumulative distribution 1 — P(As) of the price shifts As(t) for different volumes which 
correspond to the 0.1 (red, solid) 0.5 (blue, dotted) 0.9 (black, dashed) and 0.99 (orange, dashed-dotted) quantile of 
the traded volume in a double logarithmic plot. In (a) the functions for the 0.9 and 0.99 quantile coincide. 



order size v and therefore are purely the result of gaps. 
However, for the scenario with an order lifetime of 1200 
time steps we notice that the virtual market impact func- 
tions look different for large volumes that match the 0.9 
and 0.99 (dashed, dashed-dotted) quantile of the traded 
volume in Fig. 5b. 



The different shapes in case of an order lifetime of 1200 
time steps hint at a volume dependence. If we set up 
a new simulation scenario with 300 RandomTraders and 
30 BigTraders, who trade a volume that is n > 5 times 
larger than the average volume of the RandomTrader, we 
indeed observe fat-tailed return distributions. One might 
be tempted to conclude that large volumes can also be the 
cause of heavy tails, but this is not the full explanation. 
As the trader places his orders around the corresponding 
best price using a normal distribution, the probability is 
much higher that orders are placed near the best price 
than deeper in the order book. Therefore more volume is 
stored around the best prices, while deeper in the order 
book there is less volume and the gaps become larger. 
This is seen in Fig. 3b especially for the sell orders for 
prices higher than 101.5 units. The larger orders match- 
ing the 0.9 and 0.99 quantile of the traded volume dig 
deep into the order book, reaching those lesser occupied 
levels and gaps. If we lower the volume multiplier k, the 
orders hit less and less gaps deep in the order book, and 
the return distribution approaches the normal distribu- 
tion. 



IV. CONCLUSION 

In the framework of an agent-based model we iden- 
tified the gap structure in the order book as the prime 
reason for extreme price changes. These gaps can arise 
due to different reasons: older limit orders being can- 
celled automatically or manually, or traders placing limit 
orders far away from the current midpoint. In general, 
the more liquidity is provided by limit orders in the or- 
der book, the less likely are extreme price shifts - even 
when market orders with large volumes are submitted. 
The fat-tailed return distributions observed in empirical 
data reflect that, compared to the total number of shares 
outstanding, only very small volumes contribute to the 
price formation at any given time. 

One mechanism which produces gaps is the finite life- 
time of limit orders. If this lifetime is comparable to 
the rate at which new orders are placed, a gap struc- 
ture arises which leads to non-Gaussian return distribu- 
tions. Further assumptions about the trader behavior are 
not necessary. We demonstrated this by only consider- 
ing RandomTraders which place their orders with a limit 
price drawn from a normal distribution around the best 
price. Still, we observe return distributions with heavy 
tails. 

By using only traders that act randomly and do not use 
a strategy the prices in our model become Markovian. In 
this regard our traders depart from reality where prices 
follow a non-Markovian process [37], because there is a 
certain amount of traders pursuing a strategy over time. 
While it is possible to construct such traders we prefer 
to keep the model simple. From this viewpoint it is re- 
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markable and encouraging that our scenario yields the 
macroscopic observables in good qualitative agreement 
with the empirical data. In addition, we identified situa- 
tions in our model where large order volumes yield heavy- 
tailed return distributions. In this case, too, we can trace 
back the extreme price shifts to gaps that lie deep in the 
order book. The probability that orders are placed far 
away from the current best price is low and therefore 
not much volume exists deeper in the order book. These 
limit orders can only be reached by orders with very large 
volume. 

Our results support the view of Farmer et al. [22]. In 
low liquidity situations, i.e., with little volume in the or- 
der book, price gaps between limit orders are even rel- 



evant close to the current midpoint. Hence, even small 
market orders can cause large price changes. In more 
general terms, the less market participants are involved 
in the price formation process by providing limit orders 
to buy and sell, the more likely it becomes to find devi- 
ations from the purely diffusive price dynamics. 
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